Indexing Units

نویسنده

  • Jaap Kamps
چکیده

DEFINITION Indexing units refers to the granularity of information in the retrieval system’s index, which can be in principle any document part of a structured text, and as a consequence determines the possible units of retrieval. There are three basic approaches: The first approach is to index every potentially retrievable unit as a whole—the so-called element-based approach [13]. The second approach is to index disjoint nodes—and relying on aggregation or score propagation methods for scoring higher-level nodes [e.g., 1, 12]. The third approach is to index only selected elements, for example by indexing particular element types in separate indexes [10]. Various mixtures of these approaches have also been applied.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indexing Millions of Packets per Second using GPUs

Network traffic loggers are devices that record a recent window of the entire traffic in one or more network links. The traffic is stored in packet repositories that enable retrospective analyses, e.g., for forensic investigation. Traffic loggers deployed over very high-speed networks must process and store millions of packets per second using commodity hardware. To enable interactive explorati...

متن کامل

Exploring Evidence Aggregation Methods and External Expansion Sources for Medical Record Search

1. Evidence Aggregation: In last year’s track there were two different methods in general for obtaining a visit ranking out of reports (smaller document units), i.e., (A) using reports as indexing and retrieval units and then converting a report ranking into a visit ranking, and (B) using visits as indexing and retrieval units by concatenating reports at the very first stage and then obtain a v...

متن کامل

Embodied Methods to Improve Children’s Memory

Over the last decade, embodied cognition, the idea that sensorimotor processes facilitate higher cognitive processes, has proven useful for improving children's memory for a story. In order to compare the benefits of two embodiment techniques, Active Experiencing (AE) and indexing, for children's memory for a story, we compared the immediate recall of different types of idea units across three ...

متن کامل

Syntactic Approaches to Automatic Book Indexing

Automatic book indexing systems are based on the generation of phrase structures capable of reflecting text content. • Some approaches are given for the automatic construction of back-of-book indexes using a syntactic analysis of the available texts, followed by the identification of nominal constructions, the assignment of importance weights to the term phrases, and the choice of phrases as in...

متن کامل

Concept-based semantic annotation, indexing and retrieval of office-like document units

We present an ontology-driven approach to semantic annotation, indexing and retrieval of document units. This approach is based on a novel semantic document model (SDM) that we developed to make office-like document units be uniquely identified, semantically annotated with concepts from annotation ontologies and linkable across document boundaries. In the semantic annotation model that we propo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009